Lookahead And Latent Learning In ZCS
نویسنده
چکیده
Learning Classifier Systems use reinforcement learning, evolutionary computing and/or heuristics to develop adaptive systems. This paper extends the ZCS Learning Classifier System to improve its internal modelling capabilities. Initially, results are presented which show performance in a traditional reinforcement learning task incorporating lookahead within the rule structure. Then a mechanism for effective learning without external reward is examined which enables the simple learning system to build a full map of the task. That is, ZCS is shown to learn under a latent learning scenario using the lookahead scheme. Its ability to form maps in reinforcement learning tasks is then considered. keywords: animat, genetic algorithm, internal models, learning classifier system, multiplexer task.
منابع مشابه
Learn Ing Class If Ier Systems
Learning Classifier Systems use reinforcement learning, evolutionary computing and/or heuristics to develop adaptive systems. This paper extends the ZCS Learning Classifier System to improve its internal modelling capabilities. Initially, results are presented which show performance in a traditional reinforcement learning task incorporating lookahead within the rule structure. Then a mechanism ...
متن کاملZCS: A Zeroth Level Classifier System
A basic classifier system, ZCS, is presented which keeps much of Holland’s original framework but simplifies it to increase understandability and performance. ZCS’s relation to Q-learning is brought out, and their performances compared in environments of two difficulty levels. Extensions to ZCS are proposed for temporary memory, better action selection, more efficient use of the genetic algorit...
متن کاملUWELCSG01-009 An Investigation into LCS Road Traffic Junction Controllers
XCS and later ZCS are considered as adaptive intelligent controllers for road traffic networks. A simple traffic network simulator is employed and results are presented showing that within this environment both classifier system models are capable of learning reasonable policies. It is concluded that for reasons discussed, ZCS may be the more suitable system for this class of problem. Initial t...
متن کاملTournament selection in zeroth-level classifier systems based on average reward reinforcement learning
As a genetics-based machine learning technique, zeroth-level classifier system (ZCS) is based on a discounted reward reinforcement learning algorithm, bucket-brigade algorithm, which optimizes the discounted total reward received by an agent but is not suitable for all multi-step problems, especially large-size ones. There are some undiscounted reinforcement learning methods available, such as ...
متن کاملDistinct responses of Purkinje neurons and roles of simple spikes during associative motor learning in larval zebrafish
To study cerebellar activity during learning, we made whole-cell recordings from larval zebrafish Purkinje cells while monitoring fictive swimming during associative conditioning. Fish learned to swim in response to visual stimulation preceding tactile stimulation of the tail. Learning was abolished by cerebellar ablation. All Purkinje cells showed task-related activity. Based on how many compl...
متن کامل